- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources2
- Resource Type
-
0001000001000000
- More
- Availability
-
20
- Author / Contributor
- Filter by Author / Creator
-
-
Carsello, Alex (2)
-
Chen, Po-Han (2)
-
Horowitz, Mark (2)
-
Kong, Taeyoung (2)
-
Koul, Kalhan (2)
-
Liu, Qiaoyi (2)
-
Melchert, Jackson (2)
-
Nyengele, Gedeon (2)
-
Raina, Priyanka (2)
-
Strange, Maxwell (2)
-
Torng, Christopher (2)
-
Zhang, Keyi (2)
-
Adivarahan, Jayashree (1)
-
Balasingam, Akhilesh (1)
-
Bhagdikar, Nikhil (1)
-
D’Agostino, Brandon (1)
-
Emer, Joel (1)
-
Feng, Kathleen (1)
-
Hsu, Olivia (1)
-
Joshi, Pranil (1)
-
- Filter by Editor
-
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
& Spitzer, S.M. (0)
-
(submitted - in Review for IEEE ICASSP-2024) (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Feng, Kathleen; Kong, Taeyoung; Koul, Kalhan; Melchert, Jackson; Carsello, Alex; Liu, Qiaoyi; Nyengele, Gedeon; Strange, Maxwell; Zhang, Keyi; Nayak, Ankita; et al (, IEEE Journal of Solid-State Circuits)Amber is a system-on-chip (SoC) with a coarse-grained reconfigurable array (CGRA) for acceleration of dense linear algebra applications, such as machine learning (ML), image processing, and computer vision. It is designed using an agile accelerator-compiler co-design flow; the compiler updates automatically with hardware changes, enabling continuous application-level evaluation of the hardware-software system. To increase hardware utilization and minimize reconfigurability overhead, Amber features the following: 1) dynamic partial reconfiguration (DPR) of the CGRA for higher resource utilization by allowing fast switching between applications and partitioning resources between simultaneous applications; 2) streaming memory controllers supporting affine access patterns for efficient mapping of dense linear algebra; and 3) low-overhead transcendental and complex arithmetic operations. The physical design of Amber features a unique clock distribution method and timing methodology to efficiently layout its hierarchical and tile-based design. Amber achieves a peak energy efficiency of 538 INT16 GOPS/W and 483 BFloat16 GFLOPS/W. Compared with a CPU, a GPU, and a field-programmable gate array (FPGA), Amber has up to 3902x, 152x, and 107x better energy-delay product (EDP), respectively.more » « less
An official website of the United States government
